Machine-Readable Dictionaries, Lexical Data Bases And The Lexical System

نویسنده

  • Nicoletta Calzolari
چکیده

I should like to raise some issues concerning the conversion from a traditional Marhine-Readable Dictionary (MRD) on tape to a Lexical Data Base (LDB), in order to highlight some important consequences for computational linguistics which can follow from this transition. The enormous potentialities of the information implicitly stored in a standard printed dictionary or a MRD can only be evidenced and made explicit when the same data are given a new logical structure in a data base model, and exploited by appropriate software. A suitable use of DB methodology is a good starting point to discover several kinds of lexical, morphological, syntactic, and semantic relationships between lexieal entries which would otherwise have remained unexploited. Moreover, the transformation of a "very large-scale" MRD into a LDB provides the means of operating throughout the lexicon in a really extensive manner. I think in fact that an "almost exhaustive" approach to lexical facts is essential both for reliable investigations of a lexical system, and for many kinds of linguistic applications which cannot be restricted to a particular domain of discourse. The possibility of abstracting significant regularities from recurrent patterns of natural language definitions by means of suitable computational methods, and of reaching a formalization of a number of important structuring relations within the lexicon will be discussed. An overview of the "associative links" already produced in the Italian LDB, and of other allowable interconnections will be given. In a "relational" organization of a computerized dictionary with complex interlinked structures, each word acquires its meaning as a result of its position in some of the partitionings created by the formalized relations. When an entry is activated, all of its relations with other entries can be activated, too. Conversely, when a relation is activated, all of its linked concepts are made immediately available. Conceptual and linguistic information at many levels is thus interactively retrievable from the LDB following the appropriate pointers. I shall especially take into consideration those types of relations which can be of relevance not only for "Computational Lexicology" research, but also in a more general Computational Linguistics framework. An example is provided by derivational relationships which, when formalized, give rise to families of semantically and syntactically connected entries, linked to the same base-word node, and substitutable in different syntactic formulations of the same conceptual meanings. Another example concerns ease or argument relations, both (a) between lexieal items, and (b) governed by lexical …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Approach to Building the Hierarchical Element of a Lexical Knowledge Base from a Machine Readable Dictionary. an Approach to Building the Hierarchical Element of a Lexical Knowledge Base from a Machine Readable Dictionary 1

This abstract describes an approach to extracting taxonomies from machine readable dictionaries and using them to structure a lexical knowledge base which incorporates default inheritance. Taxonomy construction is based on an intuitive notion of the organisation of the substantial quantities of data in machine readable dictionaries which were developed for quite independent purposes. Our intent...

متن کامل

Multilingual Lexical Knowledge Bases: Applied WordNet Prospects

The idea of a Lexical knowledge base was recently proposed by the ESPRIT BRA AQUILEX [Briscoe 91], [Calzolari 92] project, to provide information, mostly of a semantic nature, internally consistently structured and electronically available. Three levels of lexical representation are proposed in AQUILEX: (a) Machine Readable Dictionary (MRD), i.e. an electronic version of the paper dictionary; (...

متن کامل

Creating And Querying Lexical Data Bases

Users of computerized dictionaries require powerful and flexible tools for analyzing and manipulating the information in them. This paper discusses a system for grammatically describing and parsing entries from machine-readable dictionary tapes and a lexicai data base representation for storing the dictionary information. It also describes a language for querying, formatting, and maintaining di...

متن کامل

DRAFT Automatic Creation of Lexical Knowledge Bases : New Developments in Computational

Text processing technologies require increasing amounts of information about words and phrases to cope with the massive amounts of textual material available today. Information retrieval search engines provide greater and greater coverage, but do not provide a capability for identifying the specific content that is sought. Greater reliance is placed on natural language processing (NLP) technolo...

متن کامل

LINGUISTIC DESCRIPTION IN DICTIONARIES: SEMANTICS Towards a filtering of the relevant semantic information from MRDs

Machine-readable dictionaries (henceforth MRDs) are a valuable source of lexical data for the automatic construction of lexical knowledge bases (LKBs). However, the transfer of information from one source to the other one often demands a filtering process, since in many cases the original dictionaries contain mistakes, redundancies and granularity irregularities. We claim that this filtering pr...

متن کامل

Book Reviews: Machine Translation and the Lexicon

The practical success of machine translation (MT) depends on the ability to acquire, share, and manage lexical data. Rather than reinventing lexicons for each new system and application, it is preferable to leverage common lexical resources. Increasingly, researchers are using pre-existing resources such as machine-readable dictionaries (MRDs) and corpora to acquire lexicons and term banks for ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1984